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<400> 1 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 

ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 

cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 

ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240 

gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 3 00 

tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 3 60 

cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 

attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 

atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 540 

atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 

tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 

actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 

aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 

gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840 

ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gctggctagt 900 

taagcttatt atgacctctc gccgctccgt gaagtcgggt ccgcgggagg ttccgcgcga 960 

tgagtacgag gatctgtact acaccccgtc ttcaggtatg gcgagtcccg atagtccgcc 1020 

tgacacctcc cgccgtggcg ccctacagac acgctcgcgc cagaggggcg aggtccgttt 1080 

cgtccagtac gacgagtcgg attatgccct ctacgggggc tcgtcttccg aagacgacga 1140 

acacccggag gtcccccgga cgcggcgtcc cgtttccggg gcggttttgt ccggcccggg 1200 

gcctgcgcgg gcgcctccgc cacccgctgg gtccggaggg gccggacgca cacccaccac 12 60 

cgccccccgg gccccccgaa cccagcgggt ggcgtctaag gcccccgcgg ccccggcggc 1320 

ggagaccacc cgcggcagga aatcggccca gccagaatcc gccgcactcc cagacgcccc 13 80 

cgcgtcgacg gcgccaaccc gatccaagac acccgcgcag gggctggcca gaaagctgca 1440 

ctttagcacc gcccccccaa accccgacgc gccatggacc ccccgggtgg ccggctttaa 1500 

caagcgcgtc ttctgcgccg cggtcgggcg cctggcggcc atgcatgccc ggatggcggc 1560 

tgtccagctc tgggacatgt cgcgtccgcg cacagacgaa gacctcaacg aactccttgg 1620 

catcaccacc atccgcgtga cggtctgcga gggcaaaaac ctgcttcagc gcgccaacga 1680 

gttggtgaat ccagacgtgg tgcaggacgt cgacgcggcc acggcgactc gagggcgttc 1740 
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tgcggcgtcg cgccccaccg agcgacctcg agccccagcc cgctccgctt ctcgccccag 1800 

acggcccgtc gagggtaccg agctcggatc cactagtcca gtgtggtgga attctgcaga 1860 

tatccagcac agtggcggcc gctcgagtct agagggcccg cggttcgaac aaaaactcat 1920 

ctcagaagag gatctgaata tgcataccgg tcatcatcac catcaccatt gagtttaaac 1980 

ccgctgatca gcctcgactg tgccttctag ttgccagcca tctgttgttt gcccctcccc 2040 

cgtgccttcc ttgaccctgg aaggtgccac tcccactgtc ctttcctaat aaaatgagga 2100 

aattgcatcg cattgtctga gtaggtgtca ttctattctg gggggtgggg tggggcagga 2160 

cagcaagggg gaggattggg aagacaatag caggcatgct ggggatgcgg tgggctctat 22 2 0 

ggcttctgag gcggaaagaa ccagctgggg ctctaggggg tatccccacg cgccctgtag 22 80 

cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc gtgaccgcta cacttgccag 2340 

cgccctagcg cccgctcctt tcgctttctt cccttccttt ctcgccacgt tcgccggctt 2400 

tccccgtcaa gctctaaatc ggggcatccc tttagggttc cgatttagtg ctttacggca 2460 

cctcgacccc aaaaaacttg attagggtga tggttcacgt agtgggccat cgccctgata 252 0 

gacggttttt cgccctttga cgttggagtc cacgttcttt aatagtggac tcttgttcca 2580 

aactggaaca acactcaacc ctatctcggt ctattctttt gatttataag ggattttggg 2640 

gatttcggcc tattggttaa aaaatgagct gatttaacaa aaatttaacg cgaattaatt 2700 

ctgtggaatg tgtgtcagtt agggtgtgga aagtccccag gctccccagg caggcagaag 2760 

tatgcaaagc atgcatctca attagtcagc aaccaggtgt ggaaagtccc caggctcccc 2 82 0 

agcaggcaga agtatgcaaa gcatgcatct caattagtca gcaaccatag tcccgcccct 2880 

aactccgccc atcccgcccc taactccgcc cagttccgcc cattctccgc cccatggctg 2940 

actaattttt tttatttatg cagaggccga ggccgcctct gcctctgagc tattccagaa 3000 

gtagtgagga ggcttttttg gaggcctagg cttttgcaaa aagctcccgg gagcttgtat 3060 

atccattttc ggatctgatc aagagacagg atgaggatcg tttcgcatga ttgaacaaga 3120 

tggattgcac gcaggttctc cggccgcttg ggtggagagg ctattcggct atgactgggc 318 0 

acaacagaca atcggctgct ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc 32 4 0 

ggttcttttt gtcaagaccg acctgtccgg tgccctgaat gaactgcagg acgaggcagc 33 00 

gcggctatcg tggctggcca cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac 33 60 

tgaagcggga agggactggc tgctattggg cgaagtgccg gggcaggatc tcctgtcatc 342 0 

tcaccttgct cctgccgaga aagtatccat catggctgat gcaatgcggc ggctgcatac 3480 

gcttgatccg gctacctgcc cattcgacca ccaagcgaaa catcgcatcg agcgagcacg 3540 

tactcggatg gaagccggtc ttgtcgatca ggatgatctg gacgaagagc atcaggggct 3600 

cgcgccagcc gaactgttcg ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt 3660 

cgtgacccat ggcgatgcct gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg 3720 

attcatcgac tgtggccggc tgggtgtggc ggaccgctat caggacatag cgttggctac 3780 

ccgtgatatt gctgaagagc ttggcggcga atgggctgac cgcttcctcg tgctttacgg 3 84 0 

tatcgccgct cccgattcgc agcgcatcgc cttctatcgc cttcttgacg agttcttctg 3900 

agcgggactc tggggttcgc gaaatgaccg accaagcgac gcccaacctg ccatcacgag 3 960 

atttcgattc caccgccgcc ttctatgaaa ggttgggctt cggaatcgtt ttccgggacg 4 02 0 

ccggctggat gatcctccag cgcggggatc tcatgctgga gttcttcgcc caccccaact 4080 

tgtttattgc agcttataat ggttacaaat aaagcaatag catcacaaat ttcacaaata 4140 

aagcattttt ttcactgcat tctagttgtg gtttgtccaa actcatcaat gtatcttatc 4200 

atgtctgtat accgtcgacc tctagctaga gcttggcgta atcatggtca tagctgtttc 4260 

ctgtgtgaaa ttgttatccg ctcacaattc cacacaacat acgagccgga agcataaagt 4320 

gtaaagcctg gggtgcctaa tgagtgagct aactcacatt aattgcgttg cgctcactgc 43 80 

ccgctttcca gtcgggaaac ctgtcgtgcc agctgcatta atgaatcggc caacgcgcgg 4440 

ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc gctcactgac tcgctgcgct 4500 

cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 4560 

cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 4 62 0 

accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 4680 

acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 4740 

cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 4800 

acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcaatgctca cgctgtaggt 4860 

atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 4920 

agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 4980 

acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 5040 

gtgctacaga gttcttgaag tggtggccta actacggcta cactagaagg acagtatttg 5100 

gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 5160 

gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 522 0 

gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 52 80 

acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 5340 

tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 54 00 
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ctgacagtta 
catccatagt 
ctggccccag 
caataaacca 
ccatccagtc 
tgcgcaacgt 
cttcattcag 
aaaaagcggt 
tatcactcat 
gcttttctgt 
cgagttgctc 
aagtgctcat 
tgagatccag 
tcaccagcgt 
gggcgacacg 
atcagggtta 
taggggttcc 



ccaatgctta 
tgcctgactc 
tgctgcaatg 
gccagccgga 
tattaattgt 
tgttgccatt 
ctccggttcc 
tagctccttc 
ggttatggca 
gactggtgag 
ttgcccggcg 
cattggaaaa 
ttcgatgtaa 
ttctgggtga 
gaaatgttga 
ttgtctcatg 
gcgcacattt 



atcagtgagg 
cccgtcgtgt 
ataccgcgag 
agggccgagc 
tgccgggaag 
gctacaggca 
caacgatcaa 
ggtcctccga 
gcactgcata 
tactcaacca 
tcaatacggg 
cgttcttcgg 
cccactcgtg 
gcaaaaacag 
atactcatac 
agcggataca 
ccccgaaaag 



cacctatctc 
agataactac 
acccacgctc 
gcagaagtgg 
ctagagtaag 
tcgtggtgtc 
ggcgagttac 
tcgttgtcag 
attctcttac 
agtcattctg 
ataataccgc 
ggcgaaaact 
cacccaactg 
gaaggcaaaa 
tcttcctttt 
tatttgaatg 
tgccacctga 



agcgatctgt 
gatacgggag 
accggctcca 
tcctgcaact 
tagttcgcca 
acgctcgtcg 
atgatccccc 
aagtaagttg 
tgtcatgcca 
agaatagtgt 
gccacatagc 
ctcaaggatc 
atcttcagca 
tgccgcaaaa 
tcaatattat 
tatttagaaa 
cgtc 



ctatttcgtt 
ggcttaccat 
gatttatcag 
ttatccgcct 
gttaatagtt 
tttggtatgg 
atgttgtgca 
gccgcagtgt 
tccgtaagat 
atgcggcgac 
agaactttaa 
ttaccgctgt 
tcttttactt 
aagggaataa 
tgaagcattt 
aataaacaaa 



5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6404 



<210> 2 

<211> 6420 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> vector pVP22/Myc-His-TOPO 



<400> 2 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 

ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 12 0 

cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 

ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 24 0 

gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 300 

tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 

cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 42 0 

attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 

atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 54 0 

atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 

tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 

actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 72 0 

aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 

gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 84 0 

ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gctggctagt 900 

taagcttatt atgacctctc gccgctccgt gaagtcgggt ccgcgggagg ttccgcgcga 960 

tgagtacgag gatctgtact acaccccgtc ttcaggtatg gcgagtcccg atagtccgcc 102 0 

tgacacctcc cgccgtggcg ccctacagac acgctcgcgc cagaggggcg aggtccgttt 108 0 

cgtccagtac gacgagtcgg attatgccct ctacgggggc tcgtcttccg aagacgacga 114 0 

acacccggag gtcccccgga cgcggcgtcc cgtttccggg gcggttttgt ccggcccggg 1200 

gcctgcgcgg gcgcctccgc cacccgctgg gtccggaggg gccggacgca cacccaccac 1260 
cgccccccgg gccccccgaa cccagcgggt ggcgactaag gcccccgcgg ccccggcggc • 132 0 

ggagaccacc cgcggcagga aatcggccca gccagaatcc gccgcactcc cagacgcccc 1380 

cgcgtcgacg gcgccaaccc gatccaagac acccgcgcag gggctggcca gaaagctgca 144 0 

ctttagcacc gcccccccaa accccgacgc gccatggacc ccccgggtgg ccggctttaa 1500 

caagcgcgtc ttctgcgccg cggtcgggcg cctggcggcc atgcatgccc ggatggcggc 156 0 

ggtccagctc tgggacatgt cgcgtccgcg cacagacgaa gacctcaacg aactccttgg 162 0 

catcaccacc atccgcgtga cggtctgcga gggcaaaaac ctgcttcagc gcgccaacga 1680 

gttggtgaat ccagacgtgg tgcaggacgt cgacgcggcc acggcgactc gagggcgttc 174 0 

tgcggcgtcg cgccccaccg agcgacctcg agccccagcc cgctccgctt ctcgccccag 1800 

acggcccgtc gagggtaccg agctcggatc cactagtcca gtgtggtgga attgccctta 1860 

agggcaattc tgcagatatc cagcacagtg gcggccgctc gagtctagag ggcccgcggt 1920 

tcgaacaaaa actcatctca gaagaggatc tgaatatgca taccggtcat catcaccatc 1980 

accattgagt ttaaacccgc tgatcagcct cgactgtgcc ttctagttgc cagccatctg 2040 
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ttgtttgccc ctcccccgtg ccttccttga ccctggaagg tgccactccc actgtccttt 2100 

cctaataaaa tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct attctggggg 2160 

gtggggtggg gcaggacagc aagggggagg attgggaaga caatagcagg catgctgggg 2220 

atgcggtggg ctctatggct tctgaggcgg aaagaaccag ctggggctct agggggtatc 22 80 

cccacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga 2340 

ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg 2400 

ccacgttcgc cggctttccc cgtcaagctc taaatcgggg catcccttta gggttccgat 2460 

ttagtgcttt acggcacctc gaccccaaaa aacttgatta gggtgatggt tcacgtagtg 2520 

ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt ggagtccacg ttctttaata 2580 

gtggactctt gttccaaact ggaacaacac tcaaccctat ctcggtctat tcttttgatt 2640 

tataagggat tttggggatt tcggcctatt ggttaaaaaa tgagctgatt taacaaaaat 2700 

ttaacgcgaa ttaattctgt ggaatgtgtg tcagttaggg tgtggaaagt ccccaggctc 2760 

cccaggcagg cagaagtatg caaagcatgc atctcaatta gtcagcaacc aggtgtggaa 2 82 0 

agtccccagg ctccccagca ggcagaagta tgcaaagcat gcatctcaat tagtcagcaa 2 880 

ccatagtccc gcccctaact ccgcccatcc cgcccctaac tccgcccagt tccgcccatt 2940 

ctccgcccca tggctgacta atttttttta tttatgcaga ggccgaggcc gcctctgcct 3000 

^ ctgagctatt ccagaagtag tgaggaggct tttttggagg cctaggcttt tgcaaaaagc 3060 

^ tcccgggagc ttgtatatcc attttcggat ctgatcaaga gacaggatga ggatcgtttc 3120 

gcatgattga acaagatgga ttgcacgcag gttctccggc cgcttgggtg gagaggctat 3180 

tcggctatga ctgggcacaa cagacaatcg gctgctctga tgccgccgtg ttccggctgt 324 0 

l=U cagcgcaggg gcgcccggtt ctttttgtca agaccgacct gtccggtgcc ctgaatgaac 33 00 

*%! tgcaggacga ggcagcgcgg ctatcgtggc tggccacgac gggcgttcct tgcgcagctg 3 360 

£0 tgctcgacgt tgtcactgaa gcgggaaggg actggctgct attgggcgaa gtgccggggc 342 0 

Ld aggatctcct gtcatctcac cttgctcctg ccgagaaagt atccatcatg gctgatgcaa 3480 

%.j tgcggcggct gcatacgctt gatccggcta cctgcccatt cgaccaccaa gcgaaacatc 3540 

gcatcgagcg agcacgtact cggatggaag ccggtcttgt cgatcaggat gatctggacg 3 600 

^ aagagcatca ggggctcgcg ccagccgaac tgttcgccag gctcaaggcg cgcatgcccg 3660 

j=J acggcgagga tctcgtcgtg acccatggcg atgcctgctt gccgaatatc atggtggaaa 3 72 0 

^ atggccgctt ttctggattc atcgactgtg gccggctggg tgtggcggac cgctatcagg 3780 

^ acatagcgtt ggctacccgt gatattgctg aagagcttgg cggcgaatgg gctgaccgct 3 84 0 

Nl tcctcgtgct ttacggtatc gccgctcccg attcgcagcg catcgccttc tatcgccttc 3900 

Q ttgacgagtt cttctgagcg ggactctggg gttcgcgaaa tgaccgacca agcgacgccc 3 960 

fy aacctgccat cacgagattt cgattccacc gccgccttct atgaaaggtt gggcttcgga 4020 

atcgttttcc gggacgccgg ctggatgatc ctccagcgcg gggatctcat gctggagttc 4 080 

ttcgcccacc ccaacttgtt tattgcagct tataatggtt acaaataaag caatagcatc 4140 

acaaatttca caaataaagc atttttttca ctgcattcta gttgtggttt gtccaaactc 4200 

atcaatgtat cttatcatgt ctgtataccg tcgacctcta gctagagctt ggcgtaatca 4260 

tggtcatagc tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga 4320 

gccggaagca taaagtgtaa agcctggggt gcctaatgag tgagctaact cacattaatt 4380 

gcgttgcgct cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga 4440 

atcggccaac gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc 4500 

actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg 4560 

gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc 4620 

cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc 4680 

ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga 4740 

ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc 4800 

ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcaa 4860 

tgctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg 4 920 

cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc 4980 

aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga 5040 

gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact 5100 

agaaggacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt 5160 

ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag 5220 

cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg 52 80 

tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa 5340 

aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata 5400 

tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg 5460 

atctgtctat ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata 5520 

cgggagggct taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg 5580 

gctccagatt tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct 564 0 

gcaactttat ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt 5700 
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tcgccagtta 
tcgtcgtttg 
tcccccatgt 
aagttggccg 
atgccatccg 
tagtgtatgc 
catagcagaa 
aggatcttac 
tcagcatctt 
gcaaaaaagg 
tattattgaa 
tagaaaaata 



atagtttgcg 
gtatggcttc 
tgtgcaaaaa 
cagtgttatc 
taagatgctt 
ggcgaccgag 
ctttaaaagt 
cgctgttgag 
ttactttcac 
gaataagggc 
gcatttatca 
aacaaatagg 



caacgttgtt 
attcagctcc 
agcggttagc 
actcatggtt 
ttctgtgact 
ttgctcttgc 
gctcatcatt 
atccagttcg 
cagcgtttct 
gacacggaaa 
gggttattgt 
ggttccgcgc 



gccattgcta 
ggttcccaac 
tccttcggtc 
atggcagcac 
ggtgagtact 
ccggcgtcaa 
ggaaaacgtt 
atgtaaccca 
gggtgagcaa 
tgttgaatac 
ctcatgagcg 
acatttcccc 



caggcatcgt 
gatcaaggcg 
ctccgatcgt 
tgcataattc 
caaccaagtc 
tacgggataa 
cttcggggcg 
ctcgtgcacc 
aaacaggaag 
tcatactctt 
gatacatatt 
gaaaagtgcc 



ggtgtcacgc 
agttacatga 
tgtcagaagt 
tcttactgtc 
attctgagaa 
taccgcgcca 
aaaactctca 
caactgatct 
gcaaaatgcc 
cctttttcaa 
tgaatgtatt 
acctgacgtc 



5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 



<210> 3 
<211> 12 
q <212> PRT 

<213> Artificial Sequence 

rf z <220> 

<223> Peptide linker 

03 <400> 3 

Ly Leu Ala Arg Leu Leu Ala Arg Leu Leu Ala Arg Leu 

Sf 1 5 10 

s 

q <210> 4 

f7 <211> 67 

tl <212> PRT 

^ <213> Artificial sequence 

□ <220> 

FU <22 3> Conserved sequence of steroid/ thyroid hormone 
receptor superfamily DNA-binding domain 



<221> VARIANT 
<222> (0) . . . (0) 

<223> Xaa is non-conserved amino acids within the 
DNA-binding domain 

<221> VARIANT 

<222> 7, 9, 11, 13, 22, 27, 58, 61, 66 

<223> amino acid residues that are almost universally 
conserved, but for which variations have been 
found in some identified hormone receptors 



<400> 4 

Cys Xaa Xaa Cys 
1 

Xaa Cys Xaa Xaa 
20 

Xaa Xaa Cys Xaa 
35 

Xaa Xaa Arg Xaa 
50 

Xaa Gly Met 
65 



Xaa Xaa Asp Xaa 
5 

Cys Lys Xaa Phe 

Xaa Xaa Xaa Xaa 
40 

Xaa Cys Xaa Xaa 
55 



Ala Xaa Gly Xaa 
10 

Phe Xaa Arg Xaa 
25 

Xaa Xaa Xaa Cys 

Cys Arg Xaa Xaa 
60 



Tyr Xaa Xaa Xaa 
15 

Xaa Xaa Xaa Xaa 
30 

Xaa Xaa Xaa Lys 
45 

Lys Cys Xaa Xaa 



<210> 5 
<211> 34 
<212> DNA 



<213> Bacteriophage PI 



<400> 5 

ataacttcgt atagcataca ttatacgaag ttat 

<210> 6 
<211> 34 
<212> DNA 
<213> E. coli 

<400> 6 

ataacttcgt atagtataca ttatacgaag ttat 

<210> 7 
<211> 34 
<212> DNA 
<213> E. coli 

<400> 7 

acaacttcgt ataatgtatg ctatacgaag ttat 

<210> 8 
<211> 34 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 8 

gaagttccta ttctctagaa agtataggaa cttc 

<210> 9 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide linker moiety 
<400> 9 

Gly Gly Gly Gly Ser 
1 5 

<210> 10 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide linker moiety; sequence can be repeated 
indefinit number of times 

<400> 10 

Gly Gly Gly Gly Ser 
1 5 

<210> 11 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide linker moiety 



7 



09 



<400> 11 

Gly Lys Ser Ser Gly Ser Gly Ser Glu Ser Lys Ser 
15 10 

<210> 12 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide linker moiety 
<400> 12 

Gly Ser Thr Ser Gly Ser Gly Lys Ser Ser Glu Gly Lys Gly 
15 10 

<210> 13 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide linker moiety 
<400> 13 

Gly Ser Thr Ser Gly Ser Gly Lys Ser Ser Glu Gly Ser Gly Ser Thr 

15 10 15 

Lys Gly 



<210> 14 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide linker moiety 
<400> 14 

Gly Ser Thr Ser Gly Ser Gly Lys Ser Ser Glu Gly Lys Gly 
15 10 

<210> 15 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide linker moiety 
<400> 15 

Gly Ser Thr Ser Gly Ser Gly Lys Pro Gly Ser Gly Glu Gly Ser Thr 

15 10 15 

Lys Gly 



<210> 16 
<211> 14 
<212> PRT 

<213> Artificial Sequence 



si 



<220> 

<223> Peptide linker moiety 
<400> 16 

Glu Gly Lys Ser Ser Gly Ser Gly Ser Glu Ser Lys Glu Phe 
15 10 

<210> 17 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide linker moiety 
<400> 17 

Ser Arg Ser Ser Gly 
1 5 

<210> 18 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide linker moiety 
<400> 18 

Ser Gly Ser Ser Cys 
1 5 

<210> 19 
<211> 27 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide linker moiety 
<400> 19 

Met Gly Arg Ser Gly Gly Gly Cys Ala Gly Asn Arg Val Gly Ser Ser 

15 10 15 

Leu Ser Cys Gly Gly Leu Asn Leu Gin Ala Met 
20 25 

<210> 20 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide linker moiety 

<221> VARIANT 
<222> (0) . . . (0) 

<223> Xaa is (GmS)n, where m is 

from 2 to 4 and n is from 1 to 11. 



<400> 20 

Ala Met Xaa Ala Met 




<210> 21 

<211> 16 

<212> PRT 

<213> Drosophila acanthoptera 



<400> 21 

Arg Gin lie Lys lie Trp Phe Gin Asn Arg Arg Met Lys Trp Lys Lys 
15 10 15 



